160 research outputs found
WordSup: Exploiting Word Annotations for Character based Text Detection
Imagery texts are usually organized as a hierarchy of several visual
elements, i.e. characters, words, text lines and text blocks. Among these
elements, character is the most basic one for various languages such as
Western, Chinese, Japanese, mathematical expression and etc. It is natural and
convenient to construct a common text detection engine based on character
detectors. However, training character detectors requires a vast of location
annotated characters, which are expensive to obtain. Actually, the existing
real text datasets are mostly annotated in word or line level. To remedy this
dilemma, we propose a weakly supervised framework that can utilize word
annotations, either in tight quadrangles or the more loose bounding boxes, for
character detector training. When applied in scene text detection, we are thus
able to train a robust character detector by exploiting word annotations in the
rich large-scale real scene text datasets, e.g. ICDAR15 and COCO-text. The
character detector acts as a key role in the pipeline of our text detection
engine. It achieves the state-of-the-art performance on several challenging
scene text detection benchmarks. We also demonstrate the flexibility of our
pipeline by various scenarios, including deformed text detection and math
expression recognition.Comment: 2017 International Conference on Computer Visio
Integrating Relation Constraints with Neural Relation Extractors
Recent years have seen rapid progress in identifying predefined relationship
between entity pairs using neural networks NNs. However, such models often make
predictions for each entity pair individually, thus often fail to solve the
inconsistency among different predictions, which can be characterized by
discrete relation constraints. These constraints are often defined over
combinations of entity-relation-entity triples, since there often lack of
explicitly well-defined type and cardinality requirements for the relations. In
this paper, we propose a unified framework to integrate relation constraints
with NNs by introducing a new loss term, ConstraintLoss. Particularly, we
develop two efficient methods to capture how well the local predictions from
multiple instance pairs satisfy the relation constraints. Experiments on both
English and Chinese datasets show that our approach can help NNs learn from
discrete relation constraints to reduce inconsistency among local predictions,
and outperform popular neural relation extraction NRE models even enhanced with
extra post-processing. Our source code and datasets will be released at
https://github.com/PKUYeYuan/Constraint-Loss-AAAI-2020.Comment: Accepted to AAAI-202
Symmetric Pruning in Quantum Neural Networks
Many fundamental properties of a quantum system are captured by its
Hamiltonian and ground state. Despite the significance of ground states
preparation (GSP), this task is classically intractable for large-scale
Hamiltonians. Quantum neural networks (QNNs), which exert the power of modern
quantum machines, have emerged as a leading protocol to conquer this issue. As
such, how to enhance the performance of QNNs becomes a crucial topic in GSP.
Empirical evidence showed that QNNs with handcraft symmetric ansatzes generally
experience better trainability than those with asymmetric ansatzes, while
theoretical explanations have not been explored. To fill this knowledge gap,
here we propose the effective quantum neural tangent kernel (EQNTK) and connect
this concept with over-parameterization theory to quantify the convergence of
QNNs towards the global optima. We uncover that the advance of symmetric
ansatzes attributes to their large EQNTK value with low effective dimension,
which requests few parameters and quantum circuit depth to reach the
over-parameterization regime permitting a benign loss landscape and fast
convergence. Guided by EQNTK, we further devise a symmetric pruning (SP) scheme
to automatically tailor a symmetric ansatz from an over-parameterized and
asymmetric one to greatly improve the performance of QNNs when the explicit
symmetry information of Hamiltonian is unavailable. Extensive numerical
simulations are conducted to validate the analytical results of EQNTK and the
effectiveness of SP.Comment: Accepted to International Conference on Learning Representations
(ICLR) 202
Research on Intelligent Organization and Application of Multi-source Heterogeneous Knowledge Resources for Energy Internet
ABSTRACTTo improve the informationization and intelligence of the energy Internet industry and enhance the capability of knowledge services, it is necessary to organize the energy Internet body of knowledge from existing knowledge resources of the State Grid, which have the characteristics of large scale, multiple sources, and heterogeneity. At the same time, the business fields of State Grid cover a wide range. There are many sub-fields under each business field, and the relationship between fields is diverse and complex. The key to establishing the energy Internet body of knowledge is how to fuse the heterogeneous knowledge resources from multiple sources, extract the knowledge contents from them, and organize the different relationships. This paper considers transforming the original knowledge resources of State Grid into a unified and well-organized knowledge system described in OWL language to meet the requirements of heterogeneous resource integration, multi-source resource organization, and knowledge service provision. For the State Grid knowledge resources mainly in XML format, this paper proposes a Knowledge Automatic Fusion and Organization idea and method based on XSD Directed Graph. According to the method, the XML corresponding XSD documents are transformed into a directed graph in the first stage during which the graph neural network detects hidden knowledge inside the structure to add semantic information to the graph.In the second stage, for other structured knowledge resources (e.g., databases, spreadsheets), the knowledge contents and the relationships are analyzed manually to establish the mappings from structured resources to graph structures, using which the original knowledge resources are transformed into graph structures, and merged with the directed graphs obtained in the first stage to achieve the fusion of heterogeneous knowledge resources. And expert knowledge is introduced for heterogeneous knowledge fusion to further extend the directed graph. And in the third stage, the expanded directed graph is converted to the body of knowledge in the form of OWL. This paper takes the knowledge resources in the field of human resources of the State Grid as an example, to establish the ontology of the human resources training field in a unified manner, initially demonstrating the effectiveness of the proposed method
Empirical Observations of Congestion Propagation and Dynamic Partitioning with Probe Data for Large-Scale Systems
Research on congestion propagation in large urban networks has been based mainly on microsimulations of link-level traffic dynamics. However, both the unpredictability of travel behavior and the complexity of accurate physical modeling present challenges, and simulation results may be time-consuming and unrealistic. This paper explores empirical data from large-scale urban networks to identify hidden information in the process of congestion formation. Specifically, the spatiotemporal relation of congested links is studied, congestion propagation is observed from a macroscopic perspective, and critical congestion regimes are identified to aid in the design of peripheral control strategies. To achieve these goals, the maximum connected component of congested links is used to capture congestion propagation in the city. A data set of 20,000 taxis with global positioning system (GPS) data from Shenzhen, China, is used. Empirical macroscopic fundamental diagrams of congested regions observed during propagation are presented, and the critical congestion regimes are quantified. The findings show that the proposed methodology can effectively distinguish congestion pockets from the rest of the network and efficiently track congestion evolution in linear time O(n)
Transition role of entangled data in quantum machine learning
Entanglement serves as the resource to empower quantum computing. Recent
progress has highlighted its positive impact on learning quantum dynamics,
wherein the integration of entanglement into quantum operations or measurements
of quantum machine learning (QML) models leads to substantial reductions in
training data size, surpassing a specified prediction error threshold. However,
an analytical understanding of how the entanglement degree in data affects
model performance remains elusive. In this study, we address this knowledge gap
by establishing a quantum no-free-lunch (NFL) theorem for learning quantum
dynamics using entangled data. Contrary to previous findings, we prove that the
impact of entangled data on prediction error exhibits a dual effect, depending
on the number of permitted measurements. With a sufficient number of
measurements, increasing the entanglement of training data consistently reduces
the prediction error or decreases the required size of the training data to
achieve the same prediction error. Conversely, when few measurements are
allowed, employing highly entangled data could lead to an increased prediction
error. The achieved results provide critical guidance for designing advanced
QML protocols, especially for those tailored for execution on early-stage
quantum computers with limited access to quantum resources
Multi-choice Question Answering System of WIP at NTCIR-12 QA Lab-2
ABSTRACT This paper describes a multi-choice question answering system we designed for the . This system aims at analysing and answering world history multi-choice questions in the Japanese National Center Test (in English). Our system utilizes preliminary results from an information retrieval baseline as a starting point, and improves by taking structured knowledge base as well as additional time constraints into consideration. In the final evaluation, we achieved 34 points on the 2011 test dataset. Team Name WIP Subtasks National Center Tests, Formal Run (English
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Existing learning-based autonomous driving (AD) systems face challenges in
comprehending high-level information, generalizing to rare events, and
providing interpretability. To address these problems, this work employs Large
Language Models (LLMs) as a decision-making component for complex AD scenarios
that require human commonsense understanding. We devise cognitive pathways to
enable comprehensive reasoning with LLMs, and develop algorithms for
translating LLM decisions into actionable driving commands. Through this
approach, LLM decisions are seamlessly integrated with low-level controllers by
guided parameter matrix adaptation. Extensive experiments demonstrate that our
proposed method not only consistently surpasses baseline approaches in
single-vehicle tasks, but also helps handle complex driving behaviors even
multi-vehicle coordination, thanks to the commonsense reasoning capabilities of
LLMs. This paper presents an initial step toward leveraging LLMs as effective
decision-makers for intricate AD scenarios in terms of safety, efficiency,
generalizability, and interoperability. We aspire for it to serve as
inspiration for future research in this field. Project page:
https://sites.google.com/view/llm-mp
- …